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Abstract 

In this paper, we develop new algorithms for the basic RNA folding problem. Given an RNA 
sequence that contains n nucleotides, the goal of the problem is to compute a pseudoknot-free 
secondary strncture that maximizes the number of base pairs in the sequence. We show that 
there exists a dynamic programming algorithm that can solve the problem in time „ ) 

while using only memory space. In addition, we show that the time complexity of 

this algorithm can be further improved to at the expense of a slightly increased space 

complexity. To the best of our knowledge, this is the first algorithm that can solve the problem 
with traditional dynamic programming techniques in time In addition, our results 

improve the best known upper bound of the space complexity for efficiently solving both this 
problem and the context-free language recognition problem. 
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1 Introduction 


An important problem in structural bioinformatics is to determine the secondary structure of an 
RNA molecule from its primary sequence of nucleotides [TTIIIIIIIIIIT]. Although the problem was 
originally proposed more than thirty years ago, it is still under intensive research and remains to be 
an important topic in structural bioinformatics. The problem has important applications in many 
areas of molecular biology including RNA structure prediction [TTl [H] , RNA design m and the 
study of complete genomes of RNA viruses [5]. An algorithm that can efficiently and accurately 
determine the secondary structure of an RNA sequence is thus highly desirable for researchers in 
these areas. 

Given an RNA sequence that contains n nucleotides, the goal of the basic RNA folding problem is 
to find a secondary structure that does not contain crossing base pairs and maximizes the number of 
base pairs in the sequence. The problem can be solved with a well established dynamic programming 
algorithm in O(n^) time and 0('n?) memory space [5]. 

Since this algorithm is available, researchers have made a large amount of efforts to further reduce 
the time and space complexities of the problem. In El HI], a few heuristics are developed to reduce 
the computation time of the dynamic programming algorithm in practice. In [I] , an algorithm that 


needs o(!lli 2 S 2 ^) 

log^ n 


time is developed to solve the problem. 


However, the algorithm is complex 


and difficult to implement. In [3], an additional parameter Z is introduced to describe an instance of 
the problem. Z is an integer between n and and the time and space complexities of the problem 
are shown to be 0{nZ) and 0{Z) respectively. However, the worst-case time and space complexities 
of the algorithm remain to be 0{n^) and Oiji?). Although it is observed in the same paper that 

the time complexity of the algorithm can be improved to 0 (^ combining it with a 
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new algorithm for computing all pairs of shortest paths in a graph [4], this combined algorithm is 
difficult to implement and its practical value is thus limited. 

Recently, significant progress has been made toward improving the worst-case time complexities 
of the basic RNA folding problem and a few other related problems. In [T3], a faster algorithm for 
matrix-vector min-plus multiplication is developed based on a certain discreteness assumption. It 
is shown that the worst-case time complexity of the basic RNA folding problem can be improved 
to O( iog 2 „ ) by combining this algorithm with the Valiant’s technique [TS]. In [5D], a few problem 
templates are developed and generic algorithms using Valiant’s technique are developed for problems 
that can be fit into these templates. Improved time complexity results for a few related problems 
are obtained based on this observation. 

On the other hand, new techniques have been developed to improve the computation time needed 
to solve the problem using only traditional dynamic programming techniques. In mils], it is shown 
that the problem can be solved in time with the Four-Russians technique. Four-Russians 

technique is an elegant approach that has been successfully used to reduce the worst-case time com¬ 
plexity of dynamic programming algorithms. For example, an early application of the technique |10j 
successfully reduces the worst-case time complexity of the longest common sequence problem from 
0{v?) to „ )■ Although it appears to be difficult to further reduce the time and space com¬ 

plexities of the dynamic programming approach without using Valiant’s technique. The successful 
application of the Four-Russians technique to the problem provides a new perspective on solving 
the problem with less computational resources. A question therefore arises naturally on whether 
the Four-Russians technique can be utilized to further reduce the amount of computation time and 
space needed to solve the basic RNA folding problem with only traditional dynamic programming 
techniques. 

Compared with the Valiant’s technique, traditional dynamic programming techniques are easier 
to implement in practice and can be easily extended to solve the RNA folding problem when the 
secondary structure may contain pseudoknots. It is therefore highly desirable to develop algorithms 
that are based on traditional dynamic programming techniques and also efficient in both time and 
space. 

In this paper, we develop new techniques to further utilize the Four-Russians technique to improve 
the computation time and space needed to solve the problem with traditional dynamic programming 
techniques. We start by showing that the problem can be solved with a dynamic programming 
algorithm in time while using only memory space. We then show that the 

time complexity of the algorithm can be further improved to Q( |pgy„ ) with a slightly increased 
space complexity. To the best of our knowledge, this algorithm is the first algorithm that can 
solve the problem in time without using Valiant’s technique. In addition, our results 

improve the known worst-case space complexity of the basic RNA folding problem, which is 0(jn?). 
One important advantage of our algorithms is that they are not difficult to implement. Another 
important observation is that our techniques can also be used to improve the worst-case space 
complexity needed to efficiently solve the context-free language recognition problem, which has been 
unchanged for more than thirty years. 


2 Preliminaries and Notation 

In general, an RNA molecule consists of a sequence of nucleotides. There are four types of nucleotides 
and we use letters A, C, G, U to denote them. In addition, we use S = {A, C, G, C/} to denote the 
set of all possible nucleotides a sequence may contain, an RNA sequence is considered to be a string 
formed by letters from E. A base pair is a pair of nucleotides that interact and form strong hydrogen 
bonds in the structure of an RNA sequence. 

Molecular biology has revealed that energetically stable base pairs are usually formed between 
A and U or G and G. Two nucleotides are complimentary if they can form an energetically stable 
base pair. Without loss of generality, we only consider these two possible types of base pairs in 
this paper. The secondary structure of an RNA sequence is comprised of a set of base pairs formed 
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by nucleotides in the sequence. Two base pairs in a sequence cross each other if they are neither 
nested nor in parallel in the sequence. Since the secondary structures of most RNA sequences do not 
contain crossing base pairs, the goal of the basic RNA folding problem is to compute a secondary 
structure that contains the maximum number of non-crossing base pairs. 

Given an RNA sequence, two nucleotides that form a base pair are usually separated by at least 
a certain number of nucleotides along the sequence. To simplify the presentation of our algorithm, 
we assume that any pair of complimentary nucleotides in a sequence can form a base pair. It is not 
difficult to see later that our algorithm can be slightly modified to guarantee the distance restriction 
for base pairs while maintaining its time and space complexities. 

The well known O(n^) time dynamic programming algorithm for this problem is similar to the 
CYK algorithm for context free language recognition. Specifically, given a sequence ^[l • • - n], we 
maintain a n x n table M such that stores the maximum number of non-crossing base pairs 

that can be formed in nucleotides in subsequence where 1 < i < j < n. Given two 

nucleotides a and 6, C[a,b] = 1 if a and b are complimentary, otherwise C[a,b] = 0. The recursion 
relation for computing M[i,j] is as follows. 

M[i,j] = max{M[i - lJ],M[i,j - l],M[i - l,j - 1] + j]} (1) 

where 5'[z] and 5'[j] are the nucleotides in positions i and j in S and N[i,j] can be computed as 
follows. 

N[i,j] = max {M[i,p] + M[p+ l,j]} (2) 

i<P<j 

It is not difficult to see that the computation of N[i,j] may need 0{n) time and the worst-case 
time complexity of the algorithm is thus 0{n^). Since the algorithm needs to maintain an n x n table 
M to store the intermediate computational results, its space complexity is 0{n^). It is clear that 
the computation of N[i,j] and the storage of each M[i,j] are the obstacles for us to further reduce 
its worst-case time and space complexities respectively, our major goal in the rest of the paper is 
therefore to develop methods that can compute the value of N[i,j] more efficiently while using less 
amount of memory space. 

We need to point out that the above analysis of the dynamic programming algorithm is based 
on the assumption that any positive integer that is at most n can be stored in one word of the 
computer system. Since otherwise, a single arithmetic step may need up to 0(log2 n) time and the 
worst-case time complexity of the algorithm increases from the original 0{n^) to 0{n^ \ 0 g 2 n). The 
same assumption is in fact also implied in the time complexity results obtained in most of the related 
work [BElllo]. In the rest of the paper, we refer to this assumption as the word width assumption 
and the analysis of the time complexity of our algorithm is also based on this assumption. 

3 The Memory Efficient Algorithm 

In this section, we show how the Four-Russians technique can be utilized to develop an algorithm 
that can solve the problem in time and memory space. In section HI we show 

how the time complexity of the algorithm can be further improved to 0( ”2 )■ Although the basic 

log2 n 

idea of our algorithm is similar to that of the algorithm developed in [^, additional techniques are 
used to reduce the space complexity of the problem from 0{n^) to O( iog^ „ )■ 

The basic idea of the Four-Russians technique is to divide the dynamic programming table 
into parts and precompute the possible contents in each part. The precomputation step is usually 
performed before the dynamic programming procedure starts and the results of the precomputation 
are stored in query tables. During the dynamic programming procedure, the content of each part can 
be efficiently obtained by querying the query tables with the information that has been previously 
obtained. In [6], it has been shown that precomputation and the dynamic programming procedure 
can be effectively interleaved and the time complexity of the dynamic programming procedure can 
be reduced from O(n^) to 
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Given an RNA sequence S'[l • • • n] that contains n nucleotides and integers i and j such that 
1 ^ ^ ^ we use M[i,j] to denote the maximum number of non-crossing base pairs that can be 

formed in subsequence S'[i ■ ■ ■ j]. The following lemma establishes the foundation of our algorithm. 
Although the fact stated in this lemma has been used in [B] as an obvious fact to develop the 
time algorithm, we provide a formal proof for completeness. 

Lemma 3.1 Let 1 < i < j < n, where n is the length of the sequence S, the following inequalities 
hold. 

1. M[iJ]<M[i,j + l] <M[i,j] + l 

2. M[i + lJ]<M[t,j]<M[i + l,j] + l 

Proof. We let I = M[i,j] and we use Stj to denote the set of I non-crossing base pairs that are 
formed in subsequence ■ ■ ■ j]. Similarly, we let m = M[i,j + 1] and Sij+i to denote the set of 
m non-crossing base pairs that are formed among nucleotides in subsequence If Sij+i 

contains a base pair p that forms between a nucleotide in S'[i ■ ■ ■ j] and S[j + 1], S'q^+i — {p} is a set 
of non-crossing base pairs formed among nucleotides in S[i • ■ ■ j]. Since M[i,j] is the largest number 
of non-crossing base pairs that can be formed in ■ j], we immediately obtain 


m — 1 < I (3) 

On the other hand, Sij is also a set of base pairs formed in • • • J + 1], we immediately obtain the 
following inequality from the definition of M[i,j + 1]. 

I < m (4) 

We therefore can conclude that 

M[i,j]<M[i,j + l]<M[i,j] + l (5) 

It is straightforward to see that inequality 

M[i + l,j] < M[t,j] < M[i + l,j] + 1 (6) 

holds for the same reason. The lemma thus follows. □ 


We partition the interval [1 • • - n] into k = -|- 1 disjoint intervals and each interval contains 

up to w consecutive integers, where w is an integer such that 0 < w < log 2 n. The value of w 
is determined later in the paper. At least A: — 1 of these intervals are of equal length and each of 
them contains w consecutive integers and the remaining one contains at most w consecutive integers. 
Specihcally, we use A to denote the interval [sw -|- 1, (s -I- l)w] for 0 < s < fc — 1 and Ik-i to denote 
the last interval [(fc — Ijw -I- l,n]. Each of these intervals is an aggregate interval. 

Given integers i and s such that 1 < i < sw-b 1 < n, M[i, sw-\-l],M[i, sw;-|-2], • • • , M[i, (s-bl)ic] 
form an (i,s) left vector. From Lemma 13.11 an (i,s) left vector can be completely determined by 
M[i, sw] and an s dimensional binary vector l{i, s) that only contains 0 or 1 in its components. l{i, s) 
is an {i,s) left difference vector. 

Similarly, given integers j and s such that 1 < (s -b l)w < j < n, M[sw + 1, j], M[sw -b 
,M[{s + l)w,j] form a (j, s) right vector. From Lemma l3.ll a (j,s) right vector can be 
completely determined by M[sw-b 1, j] and an s dimensional binary vector r(j, s) that only contains 
0 or 1 in its components. r(j, s) is a (j, s) right difference vector. A binary vector is a difference 
vector if there exists an integer pair (z, s) such that it is an (z, s) left difference vector or an (z, s) 
right difference vector. 

A difference vector can be considered to be the binary encoding of an integer u such that 0 < 
u < 2’"“'"^ — 1. Since w < log 2 zz, u can be stored in one word due to the word width assumption. 
Since there are in total 0{nk) difference vectors and each is associated with a pair of integers. We 
can store the integers encoded by them in two query tables that can be efficiently queried during 
the dynamic programming procedure. 
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Definition 3.1 Let S be an RNA sequence of length n. A left table Lt is an n x (fc — 1) table such 
that Lt[i,s\ stores the value encoded by the (z,s) left difference vector associated with S. 

Definition 3.2 Let S be an RNA sequence S of length n. A right table Rt is an n x (fc — 1) table 
such that Rt[j, s] stores the value encoded by the (j, s) right difference vector associated with S. 

Lt and Rt are the two tables that store the intermediate results that have been obtained during 
the dynamic programming procedure. However, they are not the tables that need to be computed 
during the precomputation stage of the Four-Russians technique. To clarify the goal of the precom¬ 
putation stage, we need the following definitions. 

Definition 3.3 Let S be an RNA sequence of length n, Iq, R, ■ ■ ■ , R-i are the aggregate intervals 
of length w in S. A central table Ct is an 2“+^ x 2™+^ table such that Ct can be accessed based on 
two integers u and v that satisfie 0 < m < 2“+^, 0 < ?; < 2“+^. Ct[u,v\ stores two positive integers 
whose values are at most w. 

Definition 3.4 Let S be an RNA sequence of length n, i, j and s are integers that satisfy 0 < s < 
fc — 1, 1 < i < sw and (s -I- l)ui < j < n. The (i-,j) representative of R is an integer r such that the 
following two conditions hold. 

1. 1 < r < w; 

2. for all p’s such that sw -|- 1 < p < (s -I- l)^, M[i,p] -\- M[p-\-\R] is maximized when p = ws -\- r. 

Definition 3.5 Let S be an RNA sequence of length n, i, j and s are integers that satisfy 0 < s < 
k — 1, 1 < i < sw and (s -I- l)rc < j < n. r is the (i, j) representative of R. The {i,j) deviation of 
R is the value of M[i, r] -|- M[r + 1, j] — M[i, sty -|- 1] — M[sty -I- 2, j]. 

Lemma [3.21 shows that a central table Ct can be precomputed such that both the (t,j) repre¬ 
sentative and deviation of an aggregate interval Is can be obtained in 0(1) time by querying Ct 
with the two integers encoded by the (t, s) left difference vector and (j, s) right difference vector 
respectively. 

Lemma 3.2 Let S be an RNA sequence of length n, i, j and s are integers that satisfy 1 < f < sry 
and (s -I- l)ry < j < n. Both the (f, j) representative and (f, j) deviation of R can be determined by 
the (i, s) left difference vector and (j, s) right difference vector. 

Proof. We use U to denote the (i, s) left difference vector and V to denote the {j, s) right difference 
vector. For 1 < <7 < re, u{q) and v{q) are the gth components of U and V respectively. U{q) is the 
value of M[i, sw + q] — M[i, sre -|- g — 1] and V{q) is the value of M[sw + q,j] — M[sw + q + 1, j]- 
From Lemma [3.11 the value of U{q) or V{q) is either 0 or 1. 

Let T = M[i,sw] + M[sw + l,j], it is not difficult to see that the following equality holds for 
1 <t < w. 

t 

M[i, sry -I- t] -I- M[sw + t + l,j] = T + ^ {U{c) — V{c)) (7) 

C=1 

Both the {i,j) representative and deviation of R can thus be found based on the components of 
U and V. The lemma thus follows. □ 

From Lemma |3.21 both the {i,j) representative and deviation of R only depend on the left and 
right difference vectors associated with (i, s) and (j, s). The two vectors are both ry-dimensional bi¬ 
nary vectors. In the precomputation stage, our algorithm thus enumerates all possible combinations 
of two ly-dimensional binary vectors and computes the representative and deviation associated with 
each possible combination and store their values in the central table. 

Specifically, the algorithm constructs a central table Ct and enumerates all possible combinations 
of two ly-dimensional binary vectors U and V. For each such combination, we use e„ and Cy to denote 
the values of integers encoded by U and V respectively. The corresponding representative r(eu,ey) 
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and deviation d{eu, e«) can be computed based on the components of U and V. The algorithm then 
stores r(e„,e«) and d{eu,ev) into Ct at Ct{eu,ev\. 

In addition to the tables we have described above, three n x ^ tables 13, E, G and one f x n 
table F are needed to store the intermediate results of dynamic programming. D is a table that 
stores the values of M[i,j]’s for all i and j’s that are in the same aggregate interval. D can be 
accessed by two integers. One is the starting position of a subsequence and the other one is the 
length of the subsequence, if is a table that stores the values of M[i, j]’s for all j’s that are the first 
position of an aggregate interval. E can be accessed by two integers. One is the starting postion of a 
subsequence and the other one is the integer identity of an aggregate interval, its first position is the 
ending position of the subsequence. G is a table that stores the value of M[i,j] for all j’s that are in 
the same aggregate interval. G is accessed by the starting position of a subsequence and the relative 
position of the ending position of the subsequence in the aggregate interval that contains the ending 
position. F is a table that stores the values of M[i,j]’s for all i’s that are the second position of an 
aggregate interval. Similarly, F can also be accessed by two integers. One is the integer identity of 
an aggregate interval whose second position is the starting position of a subsequence and the other 
one is the ending position of the subsequence. 

The algorithm computes the values of M[i,j] in the same order as the one used in [^. It follows 
the steps that are sketched below to compute the value of M[i,j] for each pair of integers i and j 
that satisfy 1 < i < j < n. 

1 . Precompute the central table Gt as described above; 

2 . set j to be 1 ; 

3. set i to be j — 1; 

4. compute the value of M[i,j] and store related values into the corresponding tables; 

5. decrement i by 1; 

6 . go to step 4 if i is larger than 0; 

7. increment j by 1; 

8 . go to step 3 if j is not larger than n; 

9. output M[l,n] and the base pairs associated with M[l,n]. 

We now describe the implementation of step 4 in detail. We consider three possible cases for 
integers i, j and the computation of M[i,j] is based on a different strategy for each case. The three 
cases are listed as follows. 

1 . i and j are in the same aggregate interval G, where 0 < e < k] 

2 . there exists integer 1 < e < k such that i is in aggregate interval G-i and j is in aggregate 
interval G; 

3. there exists a set of aggregate intervals between i and j. 

For case 1, the algorithm computes the value of M[i,j] with the same procedure as the one in 
the traditional dynamic programming algorithm. Specifically, the value of M[i,j] can be computed 
by the following steps. 

1. Set Cl to be D[i,j — i]; 

2. set C 2 to be D[i + 1, j — i]; 

3. set C 3 to be D[i + 1, j — i — 1] + G[S'[i], S[j]]] 

4. exhaustively enumerate all integers p’s that satisfy i < P < j and set C 4 to be the maximum 
value of D[i,p — i + 1] + D[p + 1, j — p]; 
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5. set D[i, j — i + V\ to be max {ci, C 2 , C 3 , C 4 } 

6 . set to be D[i,j — i + 1], 

where N is an array that stores the values oi for all i’s that satisfy 1 <i < j- Similar to case 

1, the computed value of M[i,j] is also stored in iV[i] in cases 2 and 3. 

For case 2, the algorithm computes the value of M\i, j] with the same exhaustive search procedure 
as we have described for case 1. However, since i and j are in two different but consecutive aggregate 
intervals. Rt needs to be queried to obtain the intermediate values needed for the exhaustive search. 
Based on the two aggregate intervals le-i and Ig that contain i and j respectively. The steps for 
computing M[i,j\ are described as follows. 

1. Set Cl to be N[i + 1]; 

2. set X to be Rt[j — 1, e — 1]; 

3. obtain the right difference vector associated with x and store its components in array X] 

4. set C 2 to be F[e — 1, j — 1] — if * > (e — l)ru + 2; set C 2 to be F[e — l,j — 1] 

if I = (e — l)it; + 2; set C 2 to be F[e — 1, j — 1] + V{1) if z = (e — l)z(; + 1; 

5. set C 3 to be C 2 — H(z — (e — l)w) + C[S'[z], 

6 . set V = Rt[j, e — 1]; 

7. obtain the right difference vector associated with v and store its components in array V ; 

8 . exhaustively enumerate all p’s that satisfy i < p < ew — 1 and set C 4 to be the maximum value 

of D[z,p - i + 1] + F[e - 1, j] - Viiy, 

9. if z > (e — l)zz; + 2, exhaustively enumerate all g’s that satisfy ew < q < j and set C 5 to be 

the maximum value of D[q + l,j — q] + F[e — 1, g] — E(0; where Vq stores the 

components of the right difference vector associated with Rt [q, e — 1 ]; 

10 . if z = (e — l)z(; + 2 , exhaustively enumerate all g’s that satisfy ew < q < j and set C 5 to be the 
maximum value of D[q + 1, J — <z] + F[e — !,<;]; 

11 . if z = (e — l)z(; + 1 , exhaustively enumerate all q's that satisfy ew < q < j and set C 5 to be the 
maximum value oi D[q + j — q\ + F[e — 1, q\ + Vq(l), where Vq stores the components of the 
right difference vector associated with e — 1 ]; 

12 . set A^[z] to be max{ 01 , 02 , 03 , 04 , 05 }; 

13. set G[i,j — ew] to be N[i]. 

For case 3, there exists a few consecutive aggregate intervals between z and j, the computation 
of M[i,j] needs to query both Lt and Rt to efficiently obtain the {i,j) representative and deviation 
for each such aggregate interval. Specifically, we assume z, j are contained in aggregate intervals Im 
and In respectively. The steps that compute M[z, j] are described as follows. 

1. Set oi to be iV[z + 1]; 

2. set X to be Rt[j — l,m]; 

3. obtain the right difference vector associated with x and store its components in array X; 

4. set 02 to be F[m, J — 1] — ^(0 if * > + 2; set 02 to be F[m, j — 1] if z = mw + 2; 

set 02 to be F[m,j — 1] + y(l) if z = mw + 1; 

5. set 03 to be 02 — H(z — mw) + C'[S'[z], 

6 . set V = Rt[j, zn]; 
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7. obtain the right difference vector associated with v and store its components in array V ; 

8 . exhaustively enumerate all p’s that satisfy i < p < (jn + l)r/; — 1 and set C 4 to be the maximum 

value of D[i,p — i + 1] + F[m,j] — ^(Oi 

9. if i > mw + 2, exhaustively enumerate all g’s that satisfy ew < q < j and set C 5 to be the 

maximum value of D[q+l,j — q] + F[m,q] — ^^ 9 ( 0 ) where Vq stores the components 

of the right difference vector associated with Rt [g, m] ; 

10 . if z = mw + 2 , exhaustively enumerate all q’s that satisfy ew < q < j and set C 5 to be the 
maximum value oi D[q + 1, j — q] + F[m, g]; 

11 . if z = mw + 1 , exhaustively enumerate all g’s that satisfy ew < q < j and set C 5 to be the 
maximum value of Zl[g + 1 , j — g] + F[e — 1 , g] + 1 ^( 1 ), where Vq stores the components of the 
right difference vector associated with Rt[q,m]] 

12. exhaustively enumerate all /’s that satisfy TO+l</<rz — 1 and execute steps 13 to 17 for 
each /; 

13. set X to be Rt[j, /]; 

14. set y to be Lt[i, /]; 

15. query Ct with x, y and store the deviation stored in Ct[x,y] into d; 

16. store the value of d + E[i, f] + F[f,j] into an array id; 

17. set C 6 to be the largest number in id; 

18. set iV[z] to be max{ci, 02 , 03 , 04 , 05 , 06 }; 

19. set G[i,j — nw] to be A^[z]. 

After the values of all M[z,j]’s are computed, the algorithm computes the (j, s) right difference 
vectors for each s that satisfies (s + l)w < j and write the integer encoded by it into Rt[j, s]- The 
computation of all these right different vectors can be based on array N. In addition, when j = gw 
for some integer g, the algorithm computes the (z, g) left difference vector for each z < {g — l)w based 
on G and stores the integer encoded by it into Lt[i, g\. Let w = ^ log 2 zz, we immediately obtain the 
following theorem. 

Theorem 3.1 Given an RNA sequence S of length rz, the algorithm correctly computes the value 
of M[z, j] for all integers z and j that satisfy 1 < z < j < zz in time and space ^ ). 

Proof. It is straightforward to see that the algorithm correctly computes in cases 1 and 2 

since an exaustive search among all integers between z and j is performed in both cases to compute 
M[i,j]- In case 3, the algorithm queries Gt with the integers encoded by the (z,s) left difference 
vector and (j, s) right difference vector to compute the value of (z,j) deviation for each aggregate 
interval R between z and j. From Lemma l3.1l and Lemma 13.21 the value of M[i,j] can be correctly 
computed based on the deviations obtained by querying Gt- In addition, the algorithm correctly 
updates the content of Lt and Rt after all M[z, j]’s have been computed for each j. The correctness 
of the algorithm thus follows. 

We then analyze the time complexity of the algorithm. The precomputation stage computes the 
central table Ct ■ Ct contains 2^’" locations in total and the number in each location can be computed 
in time 0{w). The precomputation stage thus needs at most 0(2^“'zc) time. Next, we consider the 
total number of pairs (z,j) for which M[i,j] needs to be computed, where 1 < i < j < n. It is 
not difficult to see that 0{nw) of these pairs are processed as case 1 or 2 and the remaining pairs 
are processed as case 3. For each pair that is processed as case 3, Ct needs to be queried for O(^) 
times and the time needed to compute the value of M\i,j] is thus O(^). In addition, the content of 
Rt can be updated in time 0(n) for a fixed j. Similarly, the content of Lt can be updated in time 




0{nw) and it needs to be updated for up to O(^) times in total. The total computation time T(n) 
of the algorithm thus can be bounded from above as follows. 

T(n) < + — + n^) ( 8 ) 

w 

where A is some positive constant. The algorithm thus needs time if w = ;| log 2 n. 

The amount of memory space needed by Rt, Lt, D, E, G, F are O(^). Ct can be implemented 
with 0(2^“') space. N needs 0{n) space and can be reused for each j. The total amount of space 
S{n) needed by the algorithm can thus be bounded from above as follows. 

S{n)<B{— + 2^'^+n) (9) 

w 

where B is some positive constant and the space complexity of the algorithm is if w = 

i log 2 n. □ 

The algorithm needs to implement a trace-back procedure to obtain the base pairs in the predicted 
secondary structure of the sequence. However, the trace-back procedure of the 0{n^) traditional 
dynamic programming algorithm cannot be directly used since the algorithm does not maintain 
an n X n table that stores the information needed for tracing back. We show that the trace- 
back procedure can be performed with a recursive algorithm in time O(n^) and 0{n) additional 
space. We use TRACEBACK to name this algorithm and it needs two parameters i and j, where 
^ j ^ n. It returns the base pairs contained in the secondary structure that corresponds to 

M[i,j] in subsequence S[i ■ ■ ■ j]. For the convenience of notation, we use Tr{i,j) to denote the set 

of base pairs returned by TRACEBACK on subsequence S[i ■ ■ ■ j]. The steps of TRACEBACK can 
be described as follows. 

1. Return an empty set if i > j or i = j; 

2. obtain the values of M[i,j], M[i,j — 1], M[i + 1, j], and M[i + l,j — 1] and store them into c. 
Cl, C 2 , and C 3 by querying tables Rt and F; 

3. obtain the values of M[i,p] and M[p + l,j] for each p that satisfies i < p < j and store their 
values into arrays O and Q by querying Lt, Rt, E, F and D; 

4. if c = Cl, recursively call TRACEBACK on i and j — 1 and return Tr{i,j — 1); 

5. if c = C 2 , recursively call TRACEBACK on z - 1 - 1 and j and return Tr{i + 1, j); 

6 . set C 3 to be C 3 -I- ^[j]] 

7. if c = C 3 , recursively call TRACEBACK on z-|- 1 and j — 1 and return {{i,j)}UTr{i + l,j — 1); 

8 . for each p that satisfies i <p < j, find the integer p that satisfies c = 0[p]-\- Q[p 

9. recursively call TRACEBACK on i and p and store the returned result in Sf, 

10. recursively call TRACEBACK on p -|- 1 and j and store the returned result in Sr] 

11. return S'; U S'r. 

Theorem 3.2 Algorithm TRACEBACK returns the base pairs in the predicted secondary structure 
of S in time 0{v?) and needs 0{n) additional space. 

Proof. The correctness of the algorithm follows directly from Theorem l3.ll We use T{r) to denote 
the computation time needed by TRACEBACK on a subsequence of length r. Since the computation 
of array O, Q and integer p can be performed in 0(r) time, the following inequality holds for T{r). 

T{r) < T{ri) + T{r — ri) + Ar (10) 
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where ri is an integer that satifies 1 < ri < r — 1 and A is some positive constant. 

We define a positive constant Aq as follows. 

^o = inax{ max /2_1) 

l<ri<4 ri^ 

We then show T(r) < Aqt'^ — r for all r > 0 by induction. First, for tq < 4, the inequality holds 
from the definition of Aq. We now assume that the inequality holds for all integers that are at most 


r, where r > 4. For T{r + 1), we have 

T(r+1) < r(ri) + r(r+l-ri)+Ar (12) 

< Ao((r + 1)^ - 2ri(r + 1 - ri)) - (r + 1) + A(r + 1) (13) 

= (Ao(r + 1)^ - (r + 1)) - (2^o?’i(?'+ 1 - T"!) - ^(?’+ 1)) (14) 

< Ao{r + if - {r + 1) (15) 


where the second inequality is due to the assumption of induction and the last inequality comes 
from the fact that r is at least 4 and Aq > AA. The inequality thus holds for all r > 0 from the 
principle of induction. The inequality suggests that TRACEBACK needs 0{n?) time to determine 
all base pairs in the predicted secondary structure of S. 

It is not difficult to see that the space of array O and Q can be reused when TRACEBACK calls 
itself recursively. TRACEBACK therefore only needs 0{n) additional space for its computation. □ 

Putting Theorems 13.11 and 13.21 together, we immediately obtain the following theorem on the 
time and space complexities of the basic RNA folding problem. 

Theorem 3.3 The basic RNA folding problem can be solved in time 0( j^" ^ ) while using only 
O( iog ^ ) memory space, where n is the length of the sequence. 

It is worth pointing out that the above result for space complexity does not rely on the word 
width assumption. To see this, we assume the word width assumption does not hold and an integer 
between 0 and n may need 0(log2 n) words for storage. If this is the case, the algorithm can create 
n + 1 blocks of memory space to store the integers from 0 to n. For a location in any table used by 
the algorithm, a pointer is stored in the location. This pointer points to the block that stores the 
value of the integer that is originally stored directly in the location. It can be seen that the above 
strategy does not increase the amount of memory needed by the tables. Since the blocks used to 
store integers from 0 to n need only 0(n log 2 n) space, the space complexity of the algorithm remains 
to be O( iog^ „ )■ However, an additional 0(log2u) factor needs to be added to the time complexity 
of the algorithm since one step for integer arithmetic may need up to 0(log2 n) time. We therefore 
obtain the following corollary. 

Corollary 3.1 The basic RNA folding problem can be solved in time O(n^) while using only 
O( jog^„ ) space when the word width assumption does not hold, where n is the length of the sequence. 
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Improve the Time Complexity to 0( 



) 


In this section, we develop strategies to further improve the time complexity of the algorithm we 
have developed in the previous section. Specifically, the algorithm does not compute the value of 
M[i,j\ for each individual pair (j,j). Instead, for a fixed j, strategies based on the Four-Russians 
technique are used to determine the values of M[i,j] for all Fs that are in the same aggregate interval 
in 0(1) time. In other words, the values of M[i,j]’s are determined in a blockwise fashion for each 
fixed j and the computation time needed by the dynamic programming is thus further reduced to 
O( jog^„ )- However, additional memory space is needed to store the results of precomputation and 
the space complexity of the algorithm becomes 0(n^ ®). 
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For each pair {i,j), the algorithm we have developed in the previous section fully determines 
the value of M[i,j] before it starts computing M[i — 1, j]. To improve the time complexity of the 
algorithm, we observe that it does not have to follow this order of evaluation to compute the values 
of all M[i,j]’s. Instead, for a fixed j, the algorithm can process all aggregate intervals to the left 
of j in an order from right to left. A partial optimal result can be computed for each pair of (i, j) 
when an aggregate interval is processed. The values of all M[*,_j]’s are finally determined when all 
aggregate intervals to the left of j are processed. 

Definition 4.1 Let S be an RNA sequence of length n. i, j, t and o are integers that satisfy 
1 < i < t < o < j < n, a {t, o) partial optimal result P[i,j,t,o] for {i,j) is defined to be 

P[i,j,t,o]= max {M[i,l] + M[l + 1, j]} (16) 

t<l<o 

Definition 4.2 Let S be an RNA sequence of length n. i, j, t and o are integers that satisfy 
l<i<t<o<j<n, a set of non-crossing base pairs in subsequence ■ j] is a {t, o) partial 
set for (j, j) if it does not contains a base pair formed between 5'[3] and S[h], where i < g < t and 
o < h < j. A {t,o) partial set for (i,j) is optimal if the number of base pairs in the set is the 
maximum of all (t,o) partial sets for (i, j). 

Lemma 4.1 Let S be an RNA sequence of length n. i, j, t and o are integers that satisfy 1 < i < 
t < o < j < n. P[z, j, t, o] is the number of base pairs in an optimal (t,o) partial set for (i,j) may 
contain. 

Proof. We use Ot to denote the set of all (t, o) partial sets for (i, j), L is the optimal (t, o) partial 
set in Ot and Ni is the number of base pairs L contains. We first show that P[i, j, t] > Ni. 

We consider the nucleotides in positions in the subsequence These nucleotides are 

middle nucleotides. If none of the base pairs in L contain a middle nucleotide, we can partition 
the base pairs in L into two disjoint subsets Li and L^, where Li contains the base pairs formed 
among nucleotides in subsequence • • • t] while Lr contains those formed among nucleotides in 
subsequence «S'[t -I-1 • • • j]. We thus have 

Ni < M[i,t] + M[t + l,j] (17) 

< P[i,j,t,o\ (18) 

where the first inequality is due to the fact that M[i,t] and M[t + l,j] are the maximum number of 
non-crossing base pairs that can be formed in subsequences iS'[i • • • z] and S[t + 1 • • • j] respectively; 
the second inequality comes from Definition 14.11 

If at least one base pair in L contains a middle nucleotide. Without loss of generality, we assume 
there exists a base pair formed between a nucleotide in position bp, where i < bp < t, and a middle 
nucleotide. Such a base pair is a left base pair. We find all middle nucleotides contained in left base 
pairs in L and select the rightmost one. We use rt to denote its position. Similarly, base pairs in 
L can be partitioned into two disjoint subsets L/ and Lr where Li contains the base pairs formed 
among nucleotides in subsequence iS'[z • • • r^] and Lr contains those formed among nucleotides in 
subsequence «S'[rt -|- 1 • • • j]. We thus have 

Ni < M[i,rt]+ M[rt + l,j] 

< P[i,j,t,o] 

where the second inequality is due to the fact that t <rt < o. Ni < P[i,j,t,o] thus holds. 

We then show that P[i,j,t] < Ni. From Definition 14.11 there exists It such that P[i,j,t,o] = 
M[i, It] + M[lt + l,j] and t < It < o. M[i, It] and M[lt + l,j] correspond to two sets of base pairs in 
subsequences S[i ■ ■ ■ k] and S')!* + 1 • ■ • j] respectively. We use and Si^+ij to denote them. It is 
clear that and Si^+ij are disjoint and Si^i^ U € Ot. We thus obtain 

P[i,j,t,o] = |5'i,q| + IS'i.+ijI (21) 

< Ni ( 22 ) 

where ]Si^i^ ] and jS'/j+ijj are the number of base pairs in Si^i^ and S'/j+ij- respectively. The inequality 
is due to the fact that L is optimal. Ni = P[i,j,t, o] thus follows. D 


(19) 

( 20 ) 
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It is not difficult to see that Definitions 14.11 and 14.21 provide an approach to computing partial 
optimal results for M[i,j] during the dynamic programming procedure. From Lemma l4.11 it is also 
clear that each (t, o) partial optimal result for (i,j) corresponds to an optimal (t,o) partial set for 
(i, j) in subsequence S[i - ■ ■ j]. The following lemma shows that a property similar to the one shown 
in Lemma [3.II for M[i,j] also holds for P[i,j,t,o]. 

Lemma 4.2 Let i, j, t, and o be integers that satisfy i<t<o<j, we have P[i,j,t,o] < 
P['i - < P[i,j,t,o] + 1 for P[i,j,t,o] and P[i - 

Proof. From Lemma [4.11 there exists an optimal {t,o) partial set Gt for (i — l,j) such that Gt 
contains P[i — non-crossing base pairs. If 5'[i] is in a base pair in Lt, we use pi to denote 

this base pair. It is not difficult to see that Gt — {pi} is a (t,o) partial set for {i,j)- From Lemma 
oi we obtain 

\Gt\-l<P[i,j,t,o] (23) 

where \Gt\ is the number of stems in Gt- Since |Gt| = P[i — l,j, t,o], we immediately obtain 

P[i-l,j,t,o]<P[i,j,t,o\ + l (24) 

If S'[i] is not contained in any base pair in Gt, Gt is a (t, o) partial set for (z,j). From Lemma 
14.11 we obtain 

\Gt\<P[i,j,t,o] (25) 

Similarly, since |G(t)| = P[i — we immediately obtain 

P[i < P[i,j,t,o\ (26) 

Putting two possible cases together, we have 

P[i-l,j,t,o\<P[i,j,t,o\ + l (27) 

On the other hand, there exists an optimal t partial set G( for {i,j) such that G( contains 
P[i,j,t,o] non-crossing stems. It is straightforward to see that G^ is also a (t, o) partial set for 
{i — l,j). We thus can obtain 

|G(| <P[*-l,j,t,o] (28) 

Since \G^\ = P[i,j,t,o], we have P[i,j,t,o] < P[i — The lemma thus follows. □ 

Lemma 14.21 provides the foundation of our algorithm. Specifically, for an aggregate interval P , 
where 0 < s < fc — 1, and fixed integers j and t such that (s-|-l)r(; < t < j. The values of 
P[{s + l)w,j,t,j — l],P[(s -I- l)w — — I],--- ,P[sw + — 1] can be represented by 

P[sw + 1, j, t,j — 1] and a w-dimensional binary vector Vr- Such a binary vector is a {j, s, t) partial 
right difference vector. 

In addition, consider two aggregate intervals , Is 2 and a fixed integer j such that si < S 2 and 
(s 2 + l)w < j- The values of P[(si -I- l)w,j, S 2 W + 1, (s 2 + 1)^], -P[(si + !)■(« — 1, j, S 2 W + 1, (s 2 + 
l)r(;], • • ■ , ^[siw -I- 1, j, S 2 W + 1, (s 2 + 1)'R'] can be represented by P[siw + l,j, S 2 W + 1, (s 2 + l)w] 
and a w-dimensional binary vector 14- Such a binary vector is a {j, si, S 2 ) partial central difference 
vector. 

Definition 4.3 Let S be an RNA sequence of length n. i, j, t and o are integers that satisfy 
l<i<t<o<j. The {t, o) partial deviation for {i,j) is defined to be P[i,j, t, o]—M[i, o]—M[o-|-l, j]. 

Definition 4.4 Let S be an RNA sequence S of length n. A partial right table Rp is an n x (A: — 1) 
table such that i?p[j, s] stores the integer encoded by the {j,s,t) partial right difference vector for 
some integer t that satisfies (s -I- l)u> < t < j and the corresponding {t,j — 1) partial deviation for 
{{s + l)w,j). 

Definition 4.5 Let S be an RNA sequence of length n, p, p, ■ ■ ■ , P-i are the aggregate intervals 
of length re in S'. A central partial table Gp is a. k x k x 2 “+^ table such that Gp can be accessed with 
three integers Si, S2 and that satisfy Q < si < S2 < k and 0 < Uc < 2 ’"+^. Gp[si,S 2 j^c] stores two 
positive integers whose values are at most w. 
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Lemma 4.3 Let S be an RNA sequence of length n, /q, Ii, • • • , Ik-i are the aggregate intervals of 
length w in S. Given integers si, S 2 and j that satisfy 0<si<S2<fc — 1 and (s 2 + l)w < j. 
The {j, si, S 2 ) partial central difference vector and the value of {s 2 W + 1, (s 2 + l)^f) partial central 
deviation for ((si + l)w,j) can be computed in time 0{w^) if the following conditions hold. 

1. The (j, S 2 ) right difference vector is available; 

2. for each js that satisfies S 2 rc + 1 < js < (s 2 + l)w, the (js, si) right difference vector is available; 

3. the ((si + S 2 ) left difference vector is available; 

Proof. We use U, X to denote the (j, S 2 ) right difference vector and the ((si + l)w,S 2 ) left 
difference vector respectively. In addition, Pi, V 2 , • • • ,14; denote the right difference vectors for all 
positions in Is2 ■ Specifically, for 1 < to < w, Vm is the {s 2 'w + m, si) right difference vector. We show 
how the (j, si, S 2 ) partial central difference vector can be computed from U, X and Pi, V 2 , • • • , Vm- 
We assume W = M[(si + 1 ) 11 ;, (s 2 + l)w] + M[{s 2 + l)rc + 1, j]. Given two integers Sf,, Sc such 
that 1 < Sb < w and 1 < Sc < w. We define J{sb,Sc) as follows. 

J{Sb, Sc) = M[siW + Sb, S 2 W + Sc] + M[s 2 W + Sc + 1, j] (29) 

It is not difficult to see that J{sb, Sc) can be computed from IP and U, X and Pi, P 2 , • • ■ , Pm as 

follows. 

W — 1 

J{Sb,Sc) = M[(si + l)w,S 2 U' + Sc] + Vsail) + M[s 2 W + Sc + l,j] 

I — Sf, 

W—1 w 

= M[{si + l)w,S 2 W + Sc] + M[(s 2 + l)w + l,j] + E 

Z^Sf, Z —S(;.~)“l 

W W — 1 W 

= W- Y. x{i) + YVsM)+ E 

Z = Sc + l Z = Sf, Z = Sc + l 

From Definition 14.11 P[siw + Sb,j, S 2 W + 1, (s 2 + l)w] can be computed from J(sb, Sc) as follows. 

P[siri; + S6,4 S 2 W + 1, (s 2 + l)w] = max {J(s6,Sc)} (33) 

IKsc^w 

The computation of J(st,,Sc) needs 0{w) time and there are in total 0{w'^) pairs of (sb,Sc)’s. The 
(j, si, S 2 ) partial central difference vector and the value of {s 2 W + 1, (s 2 + l)'ic) partial deviation for 
((si + l)w,j) can thus be computed in 0{w^) time. D 

From Lemma |4.31 the content of Cp can be precomputed for each combination of a pair of two 
different aggregate intervals 4^, Is^ that are both to the left of j and a w-dimensional binary vector 
X that represents the (j, S 2 ) right difference vector. The integer encoded by the corresponding partial 
central difference vector and the value of the corresponding partial central deviation can be stored 
in Cp. After the precomputation of Cp is performed. The algorithm can query Cp to obtain the 
integer encoded by the (j, si, S 2 ) partial central difference vector and the value of (s 2 W +1, (s 2 + l)rc) 
partial central deviation for ((si + l)i/;,j) in 0(1) time. However, the precomputation of Cp must 
be interleaved with the computation of M[z,j]’s during the dynamic programming procedure. 

Definition 4.6 Let S be an RNA sequence S of length n. An updation table Up is a four-dimensional 
table and two integers are stored in a location Up[di, d 2 ,ui,U 2 ], where di, ^ 2 , Ri and U 2 satisfy 
0 di < n, 0 < d 2 < 2n, 0 < ui < and 0 < U 2 < . One of the integers stored in a 

location in Up is at most 2’"+^ — 1 and the other one is at most n. 

Lemma 4.4 Let S be an RNA sequence of length n, /q, /i, • • • , Ik-i are the aggregate intervals of 
length w in S. Given integers Si, S 2 and j that satisfy 0<Si<S2<A: — 1 and (s 2 + l)w < j, the 
(j) Si; S 2 W + 1) partial right difference vector and the value of the (s 2 W + 1, j — 1) partial deviation 
for ((si -I- l)w,j) can be computed in time 0{w) if the following conditions hold. 


(30) 

(31) 

(32) 
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1. The (j, S 2 , (s 2 + l)w + 1) partial right difference vector is available; 

2. the ((s 2 + l)w + 1, j — 1) partial deviation for ((si + l)w,j) is available; 

3. the {j, 81 , 82 ) partial central difference vector is available; 

4. the { 82 W + 1, (s 2 + l)u>) partial deviation for ((si + l)w,j) is available; 

5. the values of M[(si + l)w,j — 1], M[{si + l)w, (s 2 + 1)H) M [{82 + l)w +1, j] are available. 

Proof. We use U to denote the (j, S2,(s2 + l)w + 1) partial right difference vector. di denotes 
the ((s 2 + l)w + 1, j — 1) partial deviation for ((si + l)w,j)-, V denotes the (j, si, S 2 ) partial central 
difference vector, dr denotes the { 82 W + 1, (s 2 + 1)'U') partial deviation for ((si + l)w,j). 

For an integer is such that 1 < is < w, we use to denote P[ 8 iw + is, j, (s 2 + l)w+ l,j — 1] 

and A 2 [is] to denote P[siw + is,j, { 82 W + 1, (s 2 + 1)r']- It is not difficult to see that the values of 
Ai[is] and A 2 [is\ can be computed based on U, V, di and ^2 as follows. 

W—1 

Ai [f«] = M[(si + l)w,j -l]+di + J 2 (34) 

I—is 

w—1 

^2 [is] = M[{8i + I)?!!, (S2 + l)w] + M [{82 + l)w + l,j] +dr+'^V{l) (35) 

I'^is 

Since the values of M[(si + l)w,j — 1], M[(si + l)w, (s 2 + l)rc], and M[(s 2 + l)w + l,j] are 
available, we are able to compute the values of and ^ 2 [*s]- It is straightforward to see that 

the following equality holds. 

P[(siw + is,j,S2W + l,j - 1] = max{yli[zs],^2[fs]} (36) 

The (j,si,S 2 W + l) partial right difference vector and the value of the {s 2 W+l, j — 1) partial deviation 
for ((si + l)w,j) can thus be computed in time 0{w). D 

From Lemma 14.41 the content of Up can be precomputed for each possible combinations of U, 
V, di, and M[{si + l)w, (s 2 + l)ui] + M[{s 2 + l)w + l,i] + dr — M[{si + V)'w,j — 1]. For each 
such combination, the corresponding partial right difference vector and partial deviation can be 
computed and stored in Up. Specifically, for each such combination, we use e^, to denote the 
integers encoded by the binary vectors in U and V and let 


di = di (37) 

d 2 = M[(si + l)w, (s 2 + l)w] + M[(s 2 + l)w + 1, j] + d 2 — M[(si + l)w,j — 1] + n (38) 

It is clear that 0 < di < n and 0 < d 2 < 2n. The integer encoded by the computed partial right 
difference vector and partial deviation are then stored in Up[di,d 2 ,eu,ey]. The precomputation of 
Up is performed before the dynamic programming procedure starts. 

Lemma 4.5 Let S be an RNA sequence of length n, /q, /i, • • • , Ik-i are the aggregate intervals of 
length w in S. Given integers s and j that satisfy 0 < s < fc — 1 and (s + l)w < j, the {j, s) right 
difference vector can be computed in time 0{w‘^) if the following conditions hold. 

1. The (j, 8, (s + l)w + 1) right partial difference vector is available; 

2. the ((s + l)w + l,j — 1) partial value for ((s + l)w,j) is available; 

3. the value of M[i,j] is available for each i that satisfies (s + l)w + 1 < z < j and its value can 
be obtained in 0(1) time; 

4. the value of M\i,jt\ is available for each jt that satisfies 1 < i < jt < j and its value can be 
obtained in 0(1) time. 
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Proof. We use U to denote the (j, s, (s + l)w + 1) right partial difference vector, dp to denote 
the ((s + l)?c + 1, j — 1) partial value for ((s + l)w,j); The values of M[i,j] for each i in Is can be 
computed with the following steps. 

1. Set i to be (s + l)w; 

2 . exhaustively enumerate each integer q that satisfies i < q < (s+l)r(; and compute the maximum 
value of M[i, q] + M[q + 1, j] and set ci to be this maximum value; 

3. set C 2 to be M[i,j - 1] + d; + 

4. set C 3 to be M[i — l,j — 1] + 

5. set C 4 to be M[i — 1, j]; 

6 . set C 5 to be M[i,j — 1]; 

7. set AI[i,j] to be max{ci, C 2 , C 3 , C 4 , C 5 } and store the result appropriately in memory; 

8 . decrement i by 1 ; 

9. if f > sw, go to step 2; 

10. compute the {j,s) right difference vector based on the values of M[i,j]’s that have been 
computed for su> + 1 < f < (s + 

It is clear that the above algorithm correctly computes the values of M[i,j]’s for all i’s that 
satisfy sw + 1 < f < (s + l)w. Since the time needed to compute M[i,j] for a single i is 0{w), the 
(j, s) right difference vector can be computed in time 0{nP'). The lemma thus follows. Cl 

In addition to tables Lt, Rt that have been defined in Section [3] and all tables defined in this 
section, the algorithm uses an n x n table Mr to store the value of M\i,j] for each i and j that 
satisfy 1 < i < j < n. The steps of the algorithm can be sketched as follows. 

1. Precompute table Up as described above; 

2 . set j to be 1 ; 

3. set s to be | — |; 

4. for each value i in R and i < j, compute M[i,j] and store its value in Mr[i,j]; 

5. for each value i in Jq, /i, • • • , R-i, compute P[i,j, sw + l,j — 1]; 

6 . for each h that satisfies 0 < h < s — 1 , compute the (j, h, sw + 1 ) partial right difference vector 
Ufi and the [sw + 1 , j — 1 ) partial deviation dh for {{h + l)w,j), store the integer encoded by 
Uh and dh into h]; 

7. set y to be s — 1; 

8 . compute the [j, y) right difference vector Vt with the algorithm in Lemma 14.51 and store the 

integer encoded by Vt into Rt[j, y]; store the value of M[i,j] into Mr[i,j] for each value i in ly 

as well; 

9. set y to be y — 1; 

10 . compute the [j,g,yw + 1 ) partial right difference vector Ug and the [yw + l,j — 1 ) partial 
deviation dg for ((y + l)w,j), store the integer encoded by Ug and dg into Rp[j,g]; 

11 . decrement y by 1 ; 

12 . if y > 0 holds, go to step 10 ; 
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13. decrement y by 1; 

14. if t > 0 holds, go to step 8; 

15. if j is the last position of /g, update L* to store the integer encoded by {i,s) left difference 
vector into Lt[i, s] for each i that satisfies 1 < i < sw] 

16. if j is the last position of Is, for each q that satisfies 0 < q < s, precompute the content of the 
locations in Cp for pair {q, s); 

17. increment j by 1; 

18. if j < n holds, go to step 3; 

19. use the trace-back algorithm developed in Section [3] to find the base pairs in the secondary 
structure and output them. 

In steps 4, 5 and 6, the algorithm exhaustively enumerates all integers q that satisfy sra-l-l < q < j 
for each i and computes the maximum value oi M[i,q] + M[q+l,j] and stores the value into [*, j] 
if i is in Ig. The value is stored in an array if j is not in Ig. For each h that satisfies 1 < /i < s — 1, 
the (j, h, SW + 1) partial right difference vector Ut and (sic-I-1, j — 1) partial deviation dh for (hw,j) 
can both be computed from N. The integer encoded by Uh and dh are then stored into table Rp at 
[j I • 

In step 8, the algorithm obtains the {j,y, {y + l)w -\- I) partial right difference vector and the 
value of the corresponding ((j/ -|- l)w; -|- 1, j — I) partial deviation for ((y -|- l)w,j) by querying the 
table Rp with j and y, it then computes the {j, t) right difference vector using the algorithm in the 
proof of Lemma IT751 This step finalizes the computation of M[i,j] for each i that is in ly. 

In step 10, the algorithm queries Rp with j and g to obtain the integer encoded by the (j, g, {y -\- 
l)ic -I- 1) partial right difference vector Ug and the {{y l)i(; l,j — I) partial deviation dy for 

((y -I- l)w,j). It then queries Rt with j and y to obtain the integer encoded by {j, y) right difference 
vector and Lt with {g + l)i(; and y to obtain the integer encoded by the ((y -I- l)w,j) left difference 
vector. We use Cy and fg to denote these two integers respectively. It then queries Cp with g, y, 
By, and fg to obtain the integer encoded by the ij,g,y) partial central difference vector Vg and the 
corresponding {yw -|- I, (y -|- l)u;) partial deviation da for ((y -|- l)w,j). We use Ug, Vg to denote the 
integers encoded by Ug and Vg respectively, the algorithm computes two other integers as follows. 

di = dy (39) 

d 2 = Mr[{g + l)w, (y -k I)w] -k Mr[{y + I)iy -k 1, j] - M[{g + l)w,j -l]+da + n (40) 

It then queries Up to obtain the two integers stored at Up[ug,di,Vg,d 2 \. One of these two integers 
is the integer encoded by the {j,g,yw -1-1) partial right difference vector and the other one is the 
corresponding {yw -\-l,j — 1) partial deviation for ((y -|- l)w,j). We store both of them into Rp at 
Rp[j,9]- 

In steps 13 and 14, the algorithm updates the table Lt if j is in the last position of Ig. For each 
i that satisfies 1 < i < sw, the integer encoded by the (i, s) left difference vector is stored in Lt 
at Lt[i,s\. In addition, since the values of all M[i,y]’s for each j in Ig have been computed, the 
algorithm performs the precomputation needed to determine the content of the locations associated 
with each pair (y, s) in Cp, where 0 < y < s. 

Theorem 4.1 Given an RNA sequence of length n, our algorithm solves the basic RNA folding 
problem in 0 (p^|t^) time and 0{n^-^) space. 

Proof. We show that our algorithm correctly solves the basic RNA folding problem in 0(p;p^) 

time and 0(n^ ®) space. Steps 4, 5 and 6 in the algorithm guarantee the correctness of the computed 
values for M[*,y]’s and P[i,j, sw -|- 1, j — l]’s since an exhaustive search is performed on all integers 
between su> -I- 1 and y — 1 in these steps. 
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The correctness of the (j, y) right difference vector computed in step 8 comes from Lemma 14.51 
and the fact that the integer encoded by the (j, y, {y + l)w + 1 ) partial right difference vector and 
the value of the corresponding ((y + l)r(; + l,j — 1) partial deviation for ((y + l)w,j) have been 
correctly computed when /j,+i is processed. 

Lemma l473l and Lemma |T4] guarantee that the integer encoded by the (j, y, yw + 1) partial right 
difference vector and the corresponding {yw + l,j — 1 ) partial deviation for ((y + can be 

correctly computed based on a few intermediate computational results. These results include the 
integer encoded by the (j, y, (y + l)ic + 1 ) partial right difference vector, the ((y + l)w + l,j — 1 ) 
partial deviation for ((y + l)w,j), the integer encoded by the ((y + l)w,y) left difference vector, 
the integer encoded by the (j, y) right difference vector, and the values of M[{s + l)i(;, (y + l)w], 
M[{y + l)ri; + l,j], and M[{g + l)w,j — 1]. It is straightforward to see that all these results have 
been correctly computed and stored in memory before step 10 is executed. Finally, steps 13 and 14 
correctly update Lt and Cp when j is in the last position of Is- The correctness of the algorithm 
thus follows. 

From Lemma 14.41 the precomputation of Up needs time. From Lemma 14.31 the 

precomputation of Cp for a given pair of aggregate intervals need 0{w^2‘^'^) time. Since there are 
O(^) pairs for which Cp needs to be computed, the total amount of time needed to precompute Cp 
is thus at most 0(n^w2^™). For each j, the algorithm needs 0{nw) time in steps 4, 5 and 6 . Step 
8 is executed only once for each aggregate interval for each j and it needs time 0{w‘^) from Lemma 
14.51 Step 10 can be executed in 0(1) time and it is executed for O(^) times for each j. Step 13 
can be completed in time 0(n) for each j. From Theorem 13.21 the last step needs O(n^) time. The 
total amount of computation time T(n) needed by the algorithm thus must satisfy 

r(n) < A(n^w^2^'" + H- 5 -+ (41) 

where A is some positive constant. Let w = ^ log 2 n and the time complexity of the algorithm is 
thus O(j^). 

For space complexity, Lt, Rt and Rp need O(^) space; Up needs 0(n^2^“') space and Cp needs 
0( "^2 ) space; Mj. needs 0{n'^) space. It is straightforward to see that the space complexity of the 
algorithm is 0{n^'^) when w — j log 2 n. The theorem thus follows. D 

It is worth pointing out that we can reduce the space complexity of the algorithm to 0(n^+'^) for 
any e > 0 by setting w = 5 elog 2 n. Although this does not change the asymptotic time complexity 
of the algorithm, a large hidden constant factor is introduced to the time complexity and doing this 
thus may not be desirable in practice. 


5 Extend to the Context-Free Language Recognition Prob¬ 
lem 

In this section, we consider the worst-case space complexity of the context-free language recognition 
problem. The goal of the problem is to determine whether a string is in the language defined by a 
context-free grammar. The problem can be solved in time 0{grA) with the well known CYK algo¬ 
rithm, where y is the number of nonterminals in the grammar and n is the length of the string. This 
algorithm shares the same recursive structure as the traditional dynamic programming algorithm 
for the basic RNA problem. 

In [15], a subcubic algorithm is developed to solve the problem based on a fast algorithm for 
boolean matrix multiplication. Since the algorithm is complex and difficult to implement in practice, 
a few other algorithms that can solve the problem in o{rA) time were developed afterwards. For 
example, in |7], an 0{ ^^ time algorithm is developed to solve the problem using the Four- 
Russians Technique. In US], Compression techniques are used to obtain an time algorithm 

for the problem. In [5] , a reduction from the boolean matrix multiplication problem to the problem 
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is established and it is shown that the time complexities of the two problems are in fact inherently 
related. 

Although the worst-case time complexity of the problem has been improved to o(n^) or even 

subcubic time. The worst-case space complexity for efficiently solving the problem remains to be 

0{n^). In [2], a lower bound of 0(log2n) is established for the space complexity of this problem. 

However, it remains unknown whether the problem can be efficiently solved in o(p?) space. As has 

been observed in previous work, the dynamic programming tables for this problem contain only 0 

or I’s. The memory efficient algorithm we have developed in Section |4] can thus be slightly modified 

2 

to efficiently solve the problem in space. 

Corollary 5.1 The context-free language recognition problem can be solved in time while 

using only space, where g is the number of nonterminals in the grammar and n is the 

length of the string. 

6 Conclusions 

In this paper, we develop new algorithms for the basic RNA folding problem. Based on the Four- 
Russians Technique, we show that the problem can be solved with a traditional dynamic program¬ 
ming algorithm in time while using only space, where n is the length of the 

sequence. In addition, we show that the Four-Russians Technique can be further utilized to solve 
the problem in time using only traditional dynamic programming techniques. Our work 

also improves the worst-case space complexity for efhcienlty solving the context-free language recog¬ 
nition problem to ^ ), where g is the number of nonterminals in the grammar and n is the 

length of the string. 

In [15], the Valiant’s technique is used to solve the context-free language recognition problem in 
subcubic time. A question therefore arises naturally on whether subcubic time solutions also exist 
for the basic RNA folding problem. The techniques developed in [15] are based on boolean matrix 
multiplications and thus cannot be directly applied to the basic RNA folding problem. Although a 
reduction from the basic RNA folding problem to the stochastic context-free parsing is available [5] , 
an efficient reduction that reduces the problem to the context-free language recognition problem is 
still unavailable. The worst-case time complexity of this problem thus remains to be an interesting 
problem and deserves additional investigations in the future. 
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